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For simplest colored branching processes we prove an analog to the McMillan 
theorem and calculate Hausdorff dimensions of random fractals defined in terms 
of the limit behavior of empirical measures generated by finite genetic lines. 
In this setting the role of Shannon's entropy is played by the Kullback-Leibler 
divergence and the Hausdorff dimensions are computed by means of the so-called 
£f) . Billingsley-Kullback entropy defined in the paper. 
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Let us consider the finite set X = {l,...,r}, whose elements denote different 
colors, and a vector (yu(l), . . . , fi(r)) G [0,1]. A simplest colored branching process 
can be denned as an evolution of a population in which all individuals live the same 
fixed time and then, when the lifetime ends, each individual generates (independently 
of others) a random set of "children" containing individuals of colors 1, . . . , r with 
probabilities /i(l), /i(r) respectively. We will suppose that the evolution starts 
with a unique initial individual. It is suitable to represent this process as a random 
genealogical tree with individuals as vertices and each vertex connected by edges with 
its children. Denote by X n the set of all genetic lines of length n (that survive up to 
generation n). The colored branching process can degenerate (when it turns out that 
starting from some n all the sets X n are empty) or, otherwise, evolve endlessly. Every 
genetic line x = (x%, . . . ,x n ) G X n generates an empirical measure 5 x ^ n on the set of 
colors X by the following rule: for each % G X the value of 8 Xjn (i) is the fraction of 
those coordinates of the vector (x\, . . . ,x n ) that coincide with i. 

Let v be an arbitrary probability measure on X. The analog to the McMillan 
theorem that will be proved below asserts that under condition of nondegeneracy of 
the colored branching process the cardinality of the set {x G X n \ 5 Xyn ~ u} has an 
almost sure asymptotics of order e~ np<yU ^\ where 



f<0 
/*(<) 



Formally, the value of p[y, \x) coincides with the usual Kullback-Leibler divergence 
and differs from the latter only in the fact that in our setting the measure \x is not 
probability and so p(u, p) can be negative. 
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In the paper we investigate also random fractals defined in terms of the sequence 
of empirical measures 5 x>n limit behavior. Let Xoo be the set of infinite genetic lines. 
Fix an arbitrary vector 6 = (0(1), . . . , 6(r)) G (0, l) r and define the following metrics 
on Xoo: 

n 

dist(x, y) = Y\_9( x t), where n = inf {t \ x t ^ y t } — 1. 

t=i 

Denote by V any set of probability measures on X. It will be proved, in particular, 
that under the condition of nondegeneracy of the colored branching process 

dim H {x G X^ \ 5 x , n ->• V} = sup d(v, fx, 9) 

vev 

almost surely, where d(u, /z, 6) is the Billingsley-Kullback entropy defined below. 

The paper can be divided into two parts. The first one (sections 1-5) contains 
known results; some of them have been modified in a certain way for the convenience 
of use in what follows. Anyway, most of them are proved below for the completeness 
and convenience of the reader. The second part (sections 6-9) contains new results. 

In addition, we note that all the results of the paper can be easily extended to 
Moran's self-similar geometric constructions in R n , but we will not do that. 



1 The spectral potential 

Let X be an arbitrary finite set. Denote by B(X) the space of all real- valued functions 
on X, by M(X) the set of all positive measures on X, and by M\(X) the collection of 
all probability distributions on X. 

Every measure fj, G M(X) determines a linear functional on B(X) of the form 



It is easily seen that this functional is positive (i.e., takes nonnegative values on non- 
negative functions). If, in addition, the measure fi is probability then this functional 
is normalized (takes the value 1 on the unit function). 
Consider the nonlinear functional 

\(i P ,H)=]nii{e' fi ], (1.1) 

where if G B(X) and /i G M(X). We will call it the spectral potential. Evidently, 
it is monotone (if <p ^ ip then \(ip,fi) ^ \(i/j,fi), additively homogeneous (that is, 
\(ip + t,fi) = \(<f, /i) +t for each constant t), and analytic in ip. 

Define a family of probability measures fi^ on X, depending on the functional 
parameter ip G B(X), by means of the formula 

M/] = / e B(X). 

Evidently, each measure /i^ is equivalent to /i and has the density e^^'^ with respect 
to /i. 
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Let us compute the first two derivatives of the spectral potential with respect to 
the argument ip. Introduce the notation 



A'fo ii)\f\ 



d\(<p + tf, n) 



dt 



t=o 



This is nothing more than the derivative of the spectral potential in the direction / at 
the point (p. An elementary computation shows that 



d\nfi[e^ +tf ] 



dt 



t=o 



;i.2) 



In other words, the derivative X'((p, fi) coincides with the probability measure fi^. Then 
put 

d 2 X{p + tf + sg,fi) 



A'W) [f,g] 



ds dt 



s,t=0 



and compute this derivative using just obtained formula (11.21) : 



A'W) [/,<?] 



d f^ +S9 f] 



ds V /4e^ +s f] 



s=0 



In probability theory the expression /i^ft] is usually called the expectation of the 
random variable / with respect to the probability distribution and the expression 
Pvifd] ~ AvL/1 /VM * s cane d the covariance of random variables / and g. In particular, 
the second derivative 



d 2 \(p + tf,fi) 



dt 2 



M/VM/] 2 = M(/-M/D 2 ] 



t=0 



is equal to the variance of the random variable / with respect to the distribution 
Since the variance is nonnegative it follows that the spectral potential is convex in ip. 



2 The Kullback action 

Denote by B*(X) the space of all linear functionals on B(X). Then, obviously, 

Mi(X) C M(X) C B*(X). 

The following functional of two arguments v G B*(X) and /i G MiX) will be called 
the Kullback action: 

(fi[(p\n<p] = v\hiip], if v G Mi(X) and v = ipfi, 
WW = i • n ( 2 - 1 ) 

[+oo m all other cases. 

To be more precise, the "all other cases" fit into at least one of the three categories: 
a) singular w. r. t. (i probability measures u, b) nonnormalized functionals u, and 
c) nonpositive functionals v. 

In the literature, as far as I know, this functional have been defined only for prob- 
ability measures v and \x. Different authors call it differently: the relative entropy, the 
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deviation function, the Kullback-Leibler information function, the Kullback-Leibler 
divergence. 

When v is a probability measure the Kullback action can be defined by the explicit 
formula 

pM = £^)ln^. (2-2) 

In particular, if p(x) = 1 then the Kullback action differs only in sign from Shannon's 
entropy 

ff(i/) = -^i/(x)lni/(x). (2.3) 

In the case of probability measure p the Kullback action is nonnegative and vanishes 
only if v = p. Indeed, if the functional v is not an absolutely continuous with respect 
to p probability measure then p{y,p) = +oo. Otherwise, if v is a probability measure 
of the form v = ip/j, then from Jensen's inequality and strong convexity of the function 
f(x) = xlnx it follows that 

^^)=*)]>/W) = o 

(so long as p[<p) = u[l] = 1), and the equality p{u,p) = holds if and only if is 
constant almost everywhere and, respectively, v coincides with p. 

Every measure p G M(X) can be put down in the form p = cpi, where c = p[l] 
and p x G M X (X). If v G M X (X) then (J22D implies 

= p{v,pi) - lnc ^ -ln/i[l]. (2.4) 

In case v ^ Mi(X) this inequality holds all the more since the Kullback action is 
infinite. 

Theorem 2.1 The spectral potential and the Kullback action satisfy the Young in- 
equality 

p{v,ft)>u[iP]-\{iP,p), (2.5) 

that turns into equality if and only if v = p^. 

Proof. If p(u, p) = +oo then the Young inequality is trivial. If p(z/, p) < +oo 
then by the definition of Kullback action the functional v is an absolutely continuous 
probability measure of the form v = ipp, where if is a nonnegative density. In this case 

X{ip,p) = \np[e^} ^\nje^dp = lnj e^~ ln * ' dv = lnz/[e^" lnv ] ^ u[ip-ln(p) 

(at the last step we have used Jensen's inequality and concavity of the logarithm 
function). Since p(v,p) = v\ln(p], this formula implies inequality (12. 5p . 
Recall that p^ = e^~ x ^'^ p. So if v = p^ then by definition 

p{v,p) = u[ip - X(tfj,p)} = u[ip] - X(ip,p). 

Vice versa, assume that p(z/, p) = v[ij;] — X(i[), p). Then subtract from the above equality 
the Young inequality piy.p) ^ v[ip] — X(<p,p). We obtain 

\(<p,p) -X(ip,p) ^ u[ip-ip\. 

From this follows that v = X'(ip,p). Finally, X'(ip,p) coincides with p^. □ 
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Theorem 2.2 The Kullback action p(u, p) is the Legendre transform w. r. t. v of the 
spectral potential: 

p{v,n)= sup {u[if)]-X(if>,fi)}, ueB*(X), /i G M(X). (2.6) 

4>£B(X) 

Proof. By the Young inequality the left hand side of (12. 6 p is not less than the right 
one. Therefore it is enough to associate with any functional v G B*(X) a family of 
functions ipt, depending on the real- valued parameter t, on which the equality in (I2.6P 
is attained. 

At first, suppose that v is an absolutely continuous with respect to /i probability 
measure of the form v = ipp, where <p is a nonnegative density. Consider the family of 
functions 

\mp(x), > 0, 

—t, <^(x) = 0. 



When t — > +oo we have the following relations 

/i[e^*] = J \p dp + J dp — > J cpdp — 1, 

v \ > \ ) i\ = J tplncpdfj, + J — tip dp = p[cp hicp), 

u[ipt\ - \(ip t ,v) = -ln/ife^] — > ti[<pln<p] = p(v,/J>), 

and so (12.61) is proved. 

In all the other cases, when v is not an absolutely continuous probability measure, 
by definition piv^p) = +oo. Let us examine this cases one after another. 

If v is a singular relative to p probability measure, then there exists xq G X such 
that p(xo) = and v{xq) > 0. In this case consider the family of functions 

t, x = x , 

0, X ^ Xq. 

It is easily seen that 

u[ip t ] - A(^,/i) > tu(xo) - ln/xfe^] ^ tu(x ) - ln/xfl]. 

The right hand side of the above formula goes to +oo while t increases and (I2.6P holds 
again. 

If the functional v is not normalized then put ip t = t. Then the expression 

i/[Vt] - \{ik,fi) = v[t] - In/i[e*] = t(v[l] - 1) - ln/i[l] 

is unbounded from the above and hence (12.61) is still valid. 

Finally, if the functional v is not positive then there exists a nonnegative function 
(p such that u[(p] < 0. Consider the family ip t = —t<p, where t > 0. For it 

v[ip t ] - \(ip t ,v) ^ -tv[(p] - X(0,p) — > +oo 

as t — y -|-oo, and (I2.6P remains in force. □ 

Corollary 2.3 The functional p( ■ , p) is convex and lower semicontinuous on B*(X). 
Proof. These are properties of the Legendre transform. □ 
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3 The local large deviations principle and 
the McMillan theorem 

As above, we keep to the following notation: X is a finite set, B(X) stands for the 
space of real- valued functions on X, B*(X) is the space of linear functionals on B(X), 
M\(X) is the set of all probability measures on X, and M(X) is the set of all positive 
measures on X. 

To each finite sequence x = (x±, . . . ,x n ) G X n let us correspond an empirical 
measure 5 XyU G Mi(X) which is supported on the set {xi, . . . , x n } and assigns to every 
point Xi the measure 1/n. The integral of any function / with respect to 6 XiTl looks like 

c m fM + ••• +f{x n ) 



n 

Denote by p n Cartesian power of a measure p G M(X), which is defined on X n . 

Theorem 3.1 (the local large deviations principle) For any measure p G M(X), 
any functional v G B*(X), and e > there exists a neighborhood 0{u) such that 

p n {x G X n | 6 Xtn G 0{v)} ^ e - n{pM - £) . (3.1) 

On the other hand, for any e > and any neighborhood 0{y) the following asymptotic 
estimate holds: 

fi n {x G X n | S x , n G 0(u)} ^ g -<pM+^) i n^oo. (3.2) 

If p(v,f-i) = +oo, then by the difference p{y,n) — e in (13. ip we mean an arbitrary 
positive number. 

In the case of probability measure p Theorem 13.11 is a partial case of Varadhan's 
large deviations principle (which explicit formulation can be found, e. g., in 0] and |14j). 
Therefore, this theorem can be deduced from Varadhan's large deviations principle by 
means of mere renormalization of p. Nevertheless, we will prove it independently for 
the purpose of completeness. 

Proof. By Theorem 12.21 for any e > there exists ip B(X) such that 

p(v,li)-e/2 < u[iP]-\(iP,v). (3.3) 
Consider the probability measure p,^ = e^~ x ^'^p. Obviously, 

d ^ n ( X ) = TT dp(Xj) = TT ^X(ip,n)—ip(xi) = e t»(A(^)-*.,nM) (3 4) 

Define a neighborhood of the functional v as follows: 

0{y) = {5 G B*(X) | S[if>] > u[i/j)-e/2}. 
Then it follows from ( 13. 4p and ( 13. 3 p that under the condition 5 x ^ n G 0(u) 

dpl(x) 



Consequently, 

p n {x G X n | S x>n G 0(v)} = J dp n (x) ^ J e n{ - pM+e) dp%(x) ^ e -"«M-e). 

Thus the first part of Theorem 13.11 is proved. 

The estimate (13. 2p is trivial if p(u,fj) = +00. So it is enough to prove it only in 
the case when v is a probability measure of the form v = ipp and the Kullback action 
p{v,n) = v\ln ip] is finite. Fix any number e > and neighborhood 0{v). Define the 

sets 

Y n = {x G X n I S x>n G 0(u), \5 X)n [\n<p] - v\\.n<p\\ < e/2} 

(the last inequality in the braces means that <p(xi) > at each point of the sequence 
x = (xi, . . . , £„)). Note that for x G Y n 

dp U (x) _ TT dp(Xj) _ T-r 1 = e -n5 x , n [lncp} > e -n(v[\n^]+e/2) 
dv n [x) dv[Xi) <fi(Xi) 

Consequently, 

p n (Y n ) = j dp n {x)^ j e- n{u[ln * ]+£,2) dv n (x) = e~ np{v ^- ne/2 v n {Y n ). (3.5) 

J Y n J Y n 

By the Law of large numbers v n {Y n ) — > 1. Hence (13. 5p implies (13. 2p . □ 

Corollary 3.2 (the McMillan theorem) For any probability measure v G M\{X) 
and e > there exists a neighborhood 0(v) such that 

#{x = ( Xl , ...,x n )eX n \ S x , n G 0{y) } < e n ^ +£ \ 

On the other hand, for any neighborhood 0(v) and e > 

#{a; G X n I 6 x>n G 0{u)} ^ e n(HM " e) as n -> 00. 

i^ere i?(^) denotes Shannon's entropy defined in (12. 3|) . 

Proof. This follows from equalities (12. 2p . (12. 3p . and the previous theorem, if we set 
p{x) = 1 for all x G X. □ 



4 Hausdorff dimension and the maximal dimension 
principle 

Let us define the Hausdorff dimension of an arbitrary metric space Q. 

Suppose that Q is covered by at most countable collection of subsets U = {Ui}. 
Denote by \U\ the diameter of this covering: \U\ = sup \ Ui\, where \Ui\ is the diameter 
of Ui. For every «6l put 

mes(W,a) = \ u i\ a - 
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The Hausdorff measure (of dimension a) of the metric space Q is 



mes(f2,a) = lim mes(U, a), 
|W|-*o 

where U is at most countable covering of Q. Obviously, 

mes(W, p) < mes(W, a) \Uf~ a if (3 ^ a. 

This implies the following property of the Hausdorff measure: if mes(f2,a) < oo for 
some a, then mes(f2, (3) = for all (3 > a. 

The Hausdorff dimension of the space Q is the number 

dim#f2 = inf{a | mes(f2,a) = 0}. (4.1) 

In other words, dim# Q = «o if mes(f2, a) = for all a > «o and mes(f2, a) = oo for 
all a < otQ. 

Below we will consider the space of sequences 

X N = {x = (xi, x 2 , x 3 , . . . ) }, Xi G X = {1, . . . , r}. 

Let x = (xi, X2, ■ ■ ■ ) G X N . Denote by Z n [x) the set of sequences y = (y±, y 2 , . . .) 
whose first n coordinates coincide with the same coordinates of x. This set will be 
called a cylinder of rank n. The collection of all cylinders generates the Tychonoff 
topology on the space X N and the cylinder a -algebra of subsets in X n . 

Take an arbitrary positive function rj on the set of all cylinders that possesses 
the following two properties: first, if Z n [x) C Z m (y) then r](Z n (x)) ^ r](Z m (y)) and, 
second, rj(Z n (x)) — )■ as n — > oo at each point x G X n . Define the cylinder metrics 
on X N by means of the formula 

dist(x,?/) = t](Z n (x)), where n = max{m | Z m {x) = Z m (y)}. (4.2) 

Evidently, the diameter of Z n (x) in this metrics coincides with 7](Z n (x)). 

Suppose on X N , besides the cylinder metrics (14. 2ft . a Borel measure [i is given. The 
function 

ln/x(Z n (x)) 
c^(x) = hm , n| 

n->oo m l^n^JI 

is called (lower) pointwise dimension of the measure fi. 

The next theorem provides an effective tool for computing the Hausdorff dimensions 
of various subsets of X N . 

Theorem 4.1 Suppose A C X N . If there exists a finite Borel measure fi on X N such 
that d^x) ^ d for each point x G A, then dim^ A ^ d. On the contrary, if d^(x) ^ d 
for each i6i and the outer measure fi*{A) is positive, then dim^ A ^ d. 

It follows that if d^(x) = d on the whole subset A C X N then its dimension is equal 
to d. 

A weakened version of the second part of Theorem 14.11 in which the condition 
dfj,(x) ^ d is replaced by the more strong one (j,(Z n (x)) ^ \Z n (x)\ d is usually called the 
mass distribution principle. 
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Proof. Every cylinder Z n (x) is, in fact, a ball in the metrics (14. 2p . whose radius 
equals to its diameter, and vice versa, any ball in this metrics coincides with a cylinder. 
Besides, any two cylinders Z n (x) and Z m (y) either have empty intersection or one of 
them is embedded into other. Therefore, while computing the Hausdorff measure and 
dimension of a subset A C X N it is enough to operate with only disjoint coverings of 
A by cylinders. 

Suppose first that d^x) < a for all points x £ A. Then for each x £ A there 
exist arbitrarily small cylinders Z n (x) satisfying the condition |Z n (x)| a < fi(Z n (x)). 
Using this kind of cylinders we can put together a disjoint covering U of the set A of 
arbitrarily small diameter. For this covering we have the inequalities 

mes(W,a)= ^ \Z n (x)\ a ^ ^ ^ M (X N ), 

z n (x)eu z n (x)eu 

and hence dim# A ^ a. Thus the first part of the theorem is proved. 
Suppose now that d^(x) > a for all points x £ A. Define the sets 

A £ = {x £ A | \Z n (x)\ a > fj,(Z n (x)) whenever \Z n (x)\ < e}. 

Obviously, A = [j £>Q A e . Hence there exists an e such that fi*(A e ) > 0. Let U be be a 
disjoint covering of A by cylinders of diameters less than e. From the definition of A £ 
it follows that mes(W,a) ^ fi*(A £ ). Therefore diuiH A ^ a, and thus the second part 
of the theorem is proved. □ 

Theorem 14.11 was first proved by Billingsley in the case when the function r\ in (14. 2 p 
is a probability measure on X N (see J3j Theorems 2.1 and 2.2]). An analog to this 
theorem for subsets icff was proved in [15] and [13] . 

Each point x = (xi,X2, ■ ■ ■ ) £ X N generates a sequence of empirical measures 5 x , n 
on the set X: 

r / -\ #{i I x t = i, t ^ n} 

o x ,n{v = > 1 e x - 

n 

In other words, 5 Xtn (i) is the fraction of those coordinates of the vector (xi, . . . ,x n ) 
that coincide with i. 

For every probability measure v £ Mi(X) let us define its basin B(y) as the set of 
all points x £ X N such that 5 Xtn converges to v. 

Evidently, basins of different measures do not intersect each other and are nonempty. 
If x £ B(u), and y £ X N differs from x in only finite number of coordinates, then 
y £ B{v). This implies density of each basin in X n . 

Every measure v £ M\{X) generates Bernoulli distribution P u = u m on the space 
X N . By the strong law of large numbers the basin B{v) has probability one with 
respect to Bernoulli distribution P u , and its complement has zero probability P v . In 
particular, any basin different from B[y) has zero probability. 

Points that does not belong to the union of all basins will be called irregular. The 
set or irregular points has zero probability with respect to any distribution P u , where 
v £ M\{X). As a result, X N turns out to be decomposed into the disjoint union of 
different basins and the set of irregular points. 

Let us fix some numbers 6{i) £ (0, 1) for all elements i £ X = {1, . . . , r}, and define 
a cylinder 9-metrics on A N by the rule 

n 

dist{x, y) = Y[6{x t ), n = mf{t\x t ^y t }-l. (4.3) 

i=l 
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It is a partial case of the cylinder metrics ( I4.2p . 

For each measure v £ M\(X) and ^-metrics (14. 3 p define the quantity 

a\y,v) - — ? — - (4.4) 

We will call it the Billingsley entropy because he was the first who wrote down this 
formula and applied it for the computation of Hausdorff dimensions [2] . He expressed 
also this quantity in terms of Shannon's entropy and the Kullback action: 

S(v,9) 



H(u) + P (u,ey 

Theorem 4.2 Hausdorff dimension of any basin B(y) relative to the 9 -metrics f!4.3[) 
is equal to the Billingsley entropy S(u,9). 

A partial case of this theorem in which 6(1) = . . . = 9(r) = 1/r was first proved by 
Eggleston [5]. In the complete form this theorem and its generalizations were proved 
by Billingsley in [2], [3] . 

Proof. Assume first that f(i) > for every i — 1, . . . , r. Obviously, 

\nP u (Z n (x)) _ ^" =1 lnz/(x t ) _ Yn=i n &x,n(i) lnz/QQ 
ln|Z n (x)| Y.t=i ln0 ( x t) Y. r i=i nS x,n( i ) ln0 ( i )' 
Hence for each point x & B(u) we have 

dp (x) = lim lnP ^ Z "( x )) = ELiZglWO = s , v jy ( 45 ) 
ln\Z n (x)\ Ei=iKO m6l W 

Applying Theorem 14.11 to the set A = B(f) and measure /i = P u , we obtain the 
statement of Theorem 14.21 

In the general case the same argument provides only lower bound dp u (x) ^ S(v, 9), 
that implies the lower bound dim h B(f) ^ S(f,9). The inverse inequality is provided 
by the next lemma. □ 

Lemma 4.3 Suppose the space X N is equipped with the metrics (I4.3p . Then for any 
measure v £ M\(X) and e > there exists a neighborhood 0(f) such that Hausdorff 
dimension of the set 

A= {x e X N \ \/N 3n> N: 5 X ,„ £ 0(f)} 

does not exceed S(f, 9) + e. 

Proof. Fix a measure v £ M\(X) and an arbitrary positive number x. By McMil- 
lan's theorem there exists a neighborhood 0(f) such that for each positive integer 
n 

#{Z n (x) | 6 X) « £ 0(f)} ^ e< H ^*\ (4.6) 

Decrease this neighborhood in such a way that, in addition, for every measure 5 £ 0(f) 
the next inequality holds: 



In 9(i) < ln e ( i ) + K - 



i=i i=i 
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Then for every cylinder Z n (x) satisfying the condition 8 xn G 0(v) we have the estimate 

\ z n{x)\ = Y[0(x t ) = expl y^lnfl(xt) V = expl ny^5 x>n (i) lnfl(z) I < 
t=i ^ t=i > ^ %=i ' 

< exp< n ^v(i)\n9(i) + nx\. (4.7) 
^ i=i > 

For any positive integer N the set A is covered by the collection of cylinders 

CO 

W,v = |J {Z n (x) I ^eO(i/)}. 



n=N 



Evidently, the diameter of this covering goes to zero when N increases. Now we can 
evaluate mes(Z^Ar,a) by means of formulas (14. 6 j) and (14.71) : 



mes(U N ,a) = \ Z n( x T < /Z e n(H(u)+x) expjcm ^ v(i) \nO(i) + anxX 

Z n (x)£U N n=N ^ i=l ' 

co s s r r \ 

= exp < n I — u(i) In z/(z) + a u(i) In + x + ax J 

n=iV ^ ^ i=l i=l ' 



(4- 



If a > S^i/, 0), then we can choose so small x > that the last exponent in braces is 
negative, and all the sum (I4.8P goes to zero as N — > oo. Therefore Hausdorff measure 
(of dimension a) of the set A is zero, and hence dim# A does not exceed a. □ 

We will say that a sequence of empirical measures 6 XtTl condenses on a subset V C 
M\(X) (notation 5 x>n >- V) if it has at least one limit point in V. 

Similarly to the famous large deviations principle by Varadhan [U [14] , it is natural 
that the next theorem be named the maximal dimension principle. 

Theorem 4.4 Let the space X N be equipped with the cylinder 8-metrics (I4.3p . Then 
for any nonempty subset V C M\{X) 

dim^ {x G X N | 5 Xj7l yV} = sup S(u, 6). (4.9) 

Proof. The set A = {x G X n | 5 XiU >- V} contains basins of all measures v G V. So 
by Theorem 14.21 its dimension is not less than the right hand side of (14. 9 j) . 

It is easily seen from the definition (14. 4p of the Billingsly entropy S(u, 6) that it 
depends continuously on the measure v G M\{X\ Consider the closure V of V. 
Obviously, it is compact. Fix any e > 0. By Lemma [4.31 for any measure v G V there 
exists a neighborhood 0{v) such that 

<\\m H {x G X N | 5 x>n >- 0(u)} ^ S{v,6)+e «C supS(u 1 6)+e. (4.10) 

uev 

Pick out a finite covering of V composed of neighborhoods of this sort. Then the set 
A = {x G X N | S x>n y V} will be covered by a finite collection of sets of the form 
{x G X N | 5 Xtn y 0(u)} satisfying (14. lOj) . By the arbitrariness of e this implies the 
statement of Theorem 14.41 □ 
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A very similar to Theorem 14.41 result was proved by Billingsley in [21 Theorem 7.1]. 

Suppose that a certain subset H C X is specified in the set X = {1, . . . , r}. In this 
case the subset S N C X N will be named the generalized Cantor set. It consists of those 
sequences x — (xi, x 2 , ■ ■ ■ ) in which all x t G H. 

Theorem 4.5 7/ i/ie space X N is equipped with the 9-metrics ( 14 .3p £/ien Hausdorff di- 
mension of the generalized Cantor set S N coincides with the unique solution of Moran's 
equation 

5>(i)' = l. (4.11) 

This theorem was first proved by Moran in 1946 [12] for generalized Cantor subsets 
of the real axis and afterwards it was extended by Hutchinson [10] to the attractors of 
self-similar geometric constructions in W . Let us show how it can be derived from the 
maximal dimension principle. 

Proof. Let s be the solution to Moran's equation. Introduce a probability distribu- 
tion v on X, setting v{i) = 9{i) s for i £ 5 and u(i) = for i 5. Then 

Consider the set 7>(z/) flH N . It has the unit measure with respect to the distribution 
P v = u N . Besides, for every point x G B{y) fl H N by (14. 5 j) we have the equality 
dp v (x) = S(v,9). In this setting it follows from Theorem 14.11 and formula (14.121) that 

dim H (B(v) n H N ) = S{u, 9) = s. (4.13) 

Denote by V the collection of all probability measures on X supported on Scl. 
Evidently, for each point x G S N all the limit points of the sequence 6 x>n belong to V. 
Hence, we can apply Theorem 14.41 that implies 

dim H E N < dim^ {x G X N \ S x>n y V] = sup S(u, 9) = sup ^^1^ ■ ( 4 - 14 ) 



Note that for every measure v G V 



ieS iGS i/(i)>0 ' 




where we have used concavity of the logarithm function. It follows that the right hand 
side in ( 14.141) does not exceed s. Finally, comparing ( 14. 13ft and (14. 14ft . we obtain the 
desired equality dim# S N = s. □ 



5 Brunching processes 

First let us introduce the basic notions about the simplest Galton- Watson brunching 
process. 

Suppose that a random variable Z takes nonnegative values k G Z + with probabili- 
ties pk- The Galton-Watson brunching process is a sequence of integer- valued random 
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variables Z Q , Zi, Z 2 , . . . such that Z = 1, Z\ — Z, and further each Z n+ i is defined 
as the sum of Z n independent counterparts of the random variable Z. In particular, 
if Z n = then Z n+ i = as well. Usually Z n is thought of as the total number of 
descendants in n-th generation of a unique common ancestor under the condition that 
each descendant independently of others gives birth to Z children. 

It is known that in some cases the posterity of the initial ancestor may degenerate 
(when starting from a certain n all Z n are zeros) and in other cases it can "flourish" 
(when Z n grows exponentially). The type of behavior of the brunching process depends 
on the mean number of children of any individual 

oo 

m = E Z = kpk 

k=0 

and on the generating function of that number 

oo 

/(*) = h(s) = J2p^ k - 

Obviously the restriction of the function f(s) to the segment [0, 1] is nonnegative, 
nondecreasing, convex, and satisfies /(l) = 1 and f'(l) = m. 

In the theory of brunching processes (see, for instance, jH [9]) the following state- 
ments have been proved. 

Theorem 5.1 The generating functions of the number of descendants in n-th gener- 
ation 

oo 

f n ( s ) = Y^P{Z n = k}s k 

k=0 

satisfy the recursion relation f n +i(s) = f(f n (s)). 

Theorem 5.2 If m ^ 1 then the brunching process degenerates almost surely (except 
the case when each individual gives birth to exactly one child). If m > 1 then the 
probability q of degeneration is less than 1 and coincides with a unique nonunit root of 
the equation f(s) = s on the segment [0, 1]. 

Theorem 5.3 If m > 1 and EZ 2 < oo then the sequence W n = Z n /m n converges 
almost surely to a random variable W such that P{W > 0} = 1 — q. If m > 1 and 
EZ 2 = oo then for any number wl < m with probability 1 — q 

lim Z n /m n < oo, lim Z n /[m') n = oo 

n— >oo n— >co 

(here q is the probability of degeneration of the brunching process). 

Thereby, in the case m > 1 there is an alternative for the total number of descen- 
dants Z n : either it vanishes at a certain moment n Q (with probability q < 1) or it is 
asymptotically equivalent to Wm n (with the complementary probability 1 — q), where 
the random variable W > does not depend on n (except the case EZ 2 = oo, when 
only the logarithmic equivalence In Z n ~ In m n is guaranteed) . All other types of the 
descendants' number behavior have zero probability. 

We will exploit these theorems in the study of colored brunching processes. 
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Suppose now that each individual may give birth to children of r different colors (or 
r different genders, if one likes). We will suppose that the posterity of each individual 
in the first generation represents a random set X containing random number k\ of 
children of the first color, random number k 2 of children of the second color, and so on 
up to k r children of color r. All elements of X (including elements of the same color) 
are treated as different. The ordered array k = (ki, k 2 , . . . , k r ) G will be called the 
color structure of the set of children X. Denote by p k the probability of birth of the 
set X with color structure k = (ki, k 2 , . . . , K). Naturally, all the probabilities pu are 
nonnegative and 

kez r + 

If an individual x\ gave birth to x 2 , then x 2 gave birth to x 3 , and so on up to an 
individual x n , then the sequence x = (x±, . . . , x n ) will be called the genetic line of 
length n. 

Let us construct a new branching process taking into account not only the total 
number of descendants but also the color of each individual and all its upward and 
downward lineal relations. This process may be thought of as a random genealogical 
tree with a common ancestor in the root and all its descendants in the vertices, where 
each parent is linked with all its children. In the case of degenerating population its 
genealogical tree is finite, and in the case of "flourishing" one the tree is infinite. 

Formally it is convenient to define such a process as a sequence of random sets X n 
containing all genetic lines of length n. As the first set X-y we take X. The subse- 
quent X n are built up by induction: if X n is already known, then for all genetic lines 
(xi, . . . ,x n ) G X n define disjoint independent random sets of children X(xy, . . . ,x n ), 
each with color structure distribution as in X, and put 

A n +i = • • • , x n , x n+ i) | (x±, . . . , x n ) G X n , x n+ \ G X(xy, . . . , £„)}. 

The built in such a way stochastic process Xi, X 2 , A3, . . . will be referred to as the 
colored branching process (or unconditional colored branching process if one wishes to 
emphasize that the posterity of any individual is independent of its color and geneal- 
ogy). 



6 The McMillan theorem for colored branching 
processes 

Consider a colored branching process X±, X 2 , ... determined by a finite collection 
of colors Q = {l,...,r} and a probability distribution {p k \ k G where k = 

(hi, . . . , k r ) is the color structure of each individual's set of children X. We will always 
think that X\ is generated by a unique initial individual. 

For any genetic line x = (x±, . . . ,x n ) G X n define the spectrum 5 Xtn as the corre- 
sponding empirical measure on Q by the rule 

r / .\ #{t\g(Xt)=i} n , v 

d x ,n(i) = , i e fi, (6.1) 
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where g(x t ) denotes the color of x t . In other words, 5 x>n (i) is the fraction of individuals 
of color i in the genetic line x. Our next goal is to obtain asymptotical estimates for 
cardinalities of the random sets 

{x = (xi, ...,x n )eX n \ 8 XtU G 0(u)}, 

where 0(v) is a small neighborhood of the distribution v on the set of colors Q. 
Denote by p,(i) the expectation of members of color % in X: 

MO = ^2 kiPk > i = 1, ■ • • , r. (6.2) 

Provided all are finite, the vector p = (/i(l), . • . , fJ>(r)) can be regarded as a 
measure on the set of colors Q. This measure generates the measure p n on Q n as 
Cartesian product. 

Define a mapping G : X n — > Q n by means of the formula 

G(xi, ...,x n ) = (g(xx), . . . , g(x n )) , 

where g(xt) is the color of Xt- 

Lemma 6.1 For any u = (u\, . . . , u n ) G Q n we have 

n 

E#{x G X n | G(x) = u] = \[p{uJ t ) = p n (co). (6.3) 

t=i 

Proof. Cast out the last coordinate in uj and let oj' = (cox, . . . , u> n _i). For any 
genetic line (xi, . . . , x n -i) G X n _ 1; by virtue of the definition of unconditional colored 
branching process we have 

E#{x n G X(xi, . . . ,x n _i) | g(x n ) = u n } = n{u n ). 

Evidently, this expression does not depend on x' — (xi, . . . , x n _i). Therefore, 

E#{x G X n | G{x) =tu} = E#{x' G X n _x | G(x') = u/}/i«). 

Repeated application of the latter equality gives (I6.3p . □ 
Define for the measure p from (16.21) the Kullback action 

p(u, n) = J] In r, G Afx(fi), 

where Mi ($7) is the set of all probability measures on Q. This formula is a copy of 

Theorem 6.2 Suppose X\, X 2 , ... is an unconditional colored brunching process with 
finite collection of colors Q. Then for any e > and probability measure v G Mi(fi) 
t/iere exists a neighborhood 0{y) C Mi(fl) suca that for all natural n 

E#{x G X n | ^ n G 0(i/)} < e »H»(".M)+e). (6.4) 

On t/ie ot/ier nanc?, /or any e > and any neighborhood 0{v) 

E#{x G X n | 4,n e 0(u)} ^ e <-pM-e) as n ^ oo. ( 6 .5) 
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If p{y, p) = +00, the expression —p(v,p) + e in (16 .4p should be treated as an 
arbitrary negative real number. 

Proof. It follows from (16.11) that for every genetic line x G X n its spectrum 5 x ^ n 
coincides with the empirical measure 8 UiTl , where u = G(x). Therefore, 

#{ieX B |^ n GO(i/)}= Yl #{xeX n \G(x) = u}. (6.6) 
It follows from (16. 3 j) and (16. 6p that 

E#{x g x n I 4,n e 0(1/)} = p n {co g ft" I <L, n e 0(1/)}. 

The latter equality converts estimates (16. 4p and (16. 5 j) into already proved estimates 
(13. ip . (13. 2 p from the large deviations principle. □ 

Remarkable that the last reference to the large deviations principle serves a unique 
"umbilical cord" linking the first three sections of the paper with others. 

Now we are ready to state an analog of the McMillan theorem for colored branching 
processes. Let q* be a probability of degeneration of the process (probability of the 
occasion that starting from a certain number n all the sets X n turn out to be empty). 

Theorem 6.3 Suppose X\, X2, ... is an unconditional colored brunching process with 
finite collection of colors Q. Then for any e > and any probability measure v G M\ (Q) 
there exists a neighborhood 0(v) C Mi(fl) such that for almost sure 

#{x G X n I 6 x , n G 0(z/)} < e "(-P(^)+e) as n 00. (6.7) 

On t/ie oi/ier hand, if p(v,p) < then for any neighborhood 0(p) and positive e the 
estimate 

#{x G X n I 4,n e 0(z/)} > e^-"^ - ^ as n^oo (6.8) 

/10/ds probability 1 — q* (or almost surely under the condition that our branching 
process does not degenerate) . 

Proof. Application of Chebyshev's inequality to (16 .4p gives 

P{#{x G X n I 5 x>n G 0(i/)} ^ e n (-^ +2£ )} < e- n£ . 
Sum up these inequalities over all n ^ N: 

-Ne 

P{3n > N: #{x G X n \ 5 x , n G 0(u)} > c <-pM+^)) <: ^—^- £ . 

This implies (16 ,7p with constant 2e instead of e, that does not change its sense. 

Proceed to the second part of the theorem. Let k = —p(u,p) — e and the number 
£ be so small that x > 0. By the second part of Theorem 16.21 for any neighborhood 
0(u) there exists A" such that 

E#{xeX N \6 X)N eO(y)}>e Nit . 

Without loss of generality we may assume that 0(v) is convex. 
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Construct a Galton-Watson branching process satisfying the conditions 

Zi = #{xeX N \8 x , N eO{u)}, (6.9) 

Z n ^ #{x G X nN | 5 x>nN G 0(u) }, n = 2, 3, . . . (6.10) 

Let the random variable Zi be defined by (16. 9p . For n > 1 define Z n as a total number 
of genetic lines 

(xi, . . . 

such that the spectrum of each segment (xkN+i, ■ ■ ■ , %(k+i)N) belongs to 0{y). In other 
words, we will treat as "individuals" of the process Zi, Z 2 , Z 3 , ... those segments 
(xkN+i, ■ ■ ■ 5 ^(fc+i)Af) of genetic lines of the initial process whose spectrum lies in 0(u). 
Then (I6.10p follows from convexity of 0(u), and from unconditionality of the initial 
colored branching process it can be concluded that the sequence Zi, Z 2 , ... in fact 
forms a Galton-Watson branching process. 

By construction, EZx > e N *. In this setting Theorem 15.31 asserts that there is an 
alternative for the sequence Z n : either it tends to zero with a certain probability q < 1 
or it grows faster than e nNK with probability 1 — q. In the second case, by virtue of 

#{x G X nN | 6 XinN G 0(u)}e' nNK -»■ oo n -»■ oo. (6.11) 

To finish the proof we have to do two things: verify that in fact (16.111) is valid with 
probability 1 — q* and get rid of the multiplier N there. To do this we will exploit two 
ideas. First, if the colored branching process X%, X 2 , ... were generated by m initial 
individuals instead of the unique one, then (16. lip would be valid with probability at 
least 1 — q m . Second, if one genetic line is a part of another and the ratio of their 
lengthes is close to 1 then their spectra are close as well. 

Obviously, the total number of individuals in the n-th generation of the initial 
branching process X±, X 2 , X%, . . . equals \X n \. The sequence of random variables \X n \ 
forms a Galton-Watson brunching process with probability of degeneration q*, that 
does not exceed q. Therefore, the sequence |Jf n | grows exponentially with probability 
1-q*. 

Consider the colored brunching process X k+1 , X k+2 , X k+3 , . . . obtained from the 
initial one by virtue of truncation of the first k generations. It represents a union of 
\X k \ independent brunching processes generated by all individuals of k-th generation. 
It satisfies (16. lip with probability at least 1 — g' Xfc L Hence for the initial process with 
even greater probability we obtain the condition 

#{x G X k+nN | 5 Xjk+nN G 0» } e~ nN * -> oo n ^ oo, (6.12) 

where 0*(v) is an arbitrary neighborhood of v containing the closure of 0[y). 

Suppose the sequence \X n \ grows exponentially. Then for every m G N define the 
numbers 

ki = min{/c : \X k \ ^ m, k = i mod N}, i = 0, 1, . . . , iV — 1. 

For each k = ki, the condition (16.121) holds with probability at least 1 — q m , and in 
common they give the estimate 

#{x G X n | 8 Xtn G 0*(u)} > e n>t as n — Y oo 
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with probability at least 1 — Nq m . By virtue of the arbitrariness of m this estimate is 
valid almost surely (under the condition \X n \ — > oo, which takes place with probability 
1 — q*). It is equivalent to ( 16. 8p . □ 



7 Dimensions of random fractals (upper bounds) 

We proceed investigation of the colored brunching process X±, X2, . . . with finite 
collection of colors Q = {1, . . . ,r}. Let us consider the corresponding set of infinite 
genetic lines 

^00 = {x = (xi,x 2 ,x 3 ,...) I (xx,...,x n ) e X n Wne N}. 
Define the cylinder ^-metrics on 

n 

dist(x,y) = Y\_0(xt), n = inf {t \ x t ^ y t } - 1, (7.1) 

t=i 

where the numbers 0(1), . . . , 9(r) are taken from (0, 1). 

We will be interested in Hausdorff dimensions of both the space X^ and its various 
subsets defined in terms of partial limits of empirical measures on Q (those measures 
are called spectra and denoted 5 XjV ,). If the colored brunching process degenerates then 
Xqo is empty. Therefore of interest is only the case when m = E|Jfi] > 1 and the 
cardinality of X n increases with rate of order m n . 

As before, denote by /i(i), where j e H, the expectation of individuals of color % in 
the random set X\. It will be always supposed that \x{i) < 00. Consider any probability 
measure v G Mi(ft). It will be proved below that the dimension of {x G X^ \ 5 x>n — > u} 
can be computed by means of the function 

d(u, * 9) = P[V f = —$L . (7.2) 

We will name it the Billingsley-Kullback entropy. 

In (17. 2p the numerator is the Kullback action and the denominator is negative. If 
fi is a probability measure on Q then the Kullback action is nonnegative. But in our 
setting this is not the case since m = + . . . + /i(r) > 1. In particular, if fi(i) > 
for all i G fl then the Kullback action will be negative, and the Billingsley-Kullback 
entropy positive. Note, in addition, that if /i(l) = ... = /i(r) = 1 then — p(z/, /i) is 
equal to Shannon's entropy H(i>), and the whole of Billingsley-Kullback entropy turns 
into the Billingsley entropy (14.41) . 

Lemma 7.1 Let the space X^ of infinite genetic lines be equipped with the metrics 
( 17. ip . Then for any probability measure v G Mi(Q) and any e > there exists a 
neighborhood 0(v) such that Hausdorff dimension of the set 

A={xeX OD \MN3n>N: 5 x , n G 0{v)} 

does not exceed g?(z/, /1, 9) + e almost surely. 
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Proof is carried out in the same manner as in Lemma 14.31 Take any x > 0. By 
Theorem 16.41 there exists a neighborhood 0{y) such that almost surely 

#{x G X n | 6 Xtn G 0{v)} < e »(-p(">A0+*) as n ^ ^ ( 7 . 3 ) 
Reduce this neighborhood in such a way that in addition for all measures 5 G 0(v), 

r r 

In 9(i) < v i l ) ln Q tf) + x - 
i=i i=i 

Then for each cylinder Z n (x) satisfying the condition 5 x n G 0(z/) we have the estimate 
\ z n{x)\ = Y[d(x t ) = expl Y]hi9(x t ) \ = expl n y~] 5 Xtn (i) In 9(i) \ < 

t=l W=l ^ ^ 8=1 ' 

< exp-j n u{i) ln9(i) + nx > . (7.4) 
^ i=i ^ 

For every natural N the set A is covered by the collection of cylinders 

oo 

U N = [J{Z n (x) I 5 x , n eO(v)}. 

n=N 

Evidently, the diameter of this covering tends to zero as iV — > oo. Hence mes(WAr,a) 
can be estimated by virtue of formulas (17. 3p and (J7T4J) : 



mes(Wjv,a) = Yl \ Z n( x T < £ e <- P M+*) exp ! an v(i) ln 9(i) + anxX = 

Z n {x)eU N n=N { i=l ' 

= ^ expjn ^ v(i) ln + a ^ ln 9(i) + x + ax^ j. (7.5) 

If a > <i(z/, /i, 0) then x can be chosen so small that the last exponent in braces is 
negative, and all the sum (17. 5p tends to zero as N — > oo. Therefore Hausdorff measure 
(of dimension a) of the set A is zero, and its dimension does not exceed a. □ 

As before, we say that the sequence of empirical measures 8 XjTl condenses on a subset 
V C Mi(fi) (notation 8 x>n >~ V) if it has a limit point in V. 

Theorem 7.2 Let X\, X2, X3, ... be an unconditional colored brunching process with 
finite set of colors Q, and the set X^ of all infinite genetic lines equipped with the 
cylinder metrics ( 17. ip . Then for any subset V C Mi(fi) almost surely 

dim#{x G Xoo \ 5 XiTl >- V} < sup d(u, fi, 9) . (7.6) 

In particular, dim# s for almost sure, where s is a unique root of the "Bowen 

equation" 

r 

j>(i)0(O* = 1- (7-7) 



i=l 
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Proof. It follows from the definition of the Billingsley-Kullback entropy d(v, /i, 9) 
that it depends continuously on the measure v G Mi(Q). Let V be the closure of V. 
Obviously, it is compact. Take an arbitrary e > 0. By Lemma 17.11 for any measure 
v G V there exists a neighborhood 0(u) such that almost surely 

dim H {x G Xoo | 5 x>n y 0{v) } ^ <i(i>, fi,9) + e < sup d(z/, //, 6 1 ) + e. (7.8) 

Choose a finite covering of V by neighborhoods of this kind. Then the set {x G Xoo | 
8x,n y V} will be covered by a finite collection of sets of the form {x G | 5 Xjn y 
0{y)} satisfying (17. 8ft . By the arbitrariness of e this implies the first statement of 
Theorem 17.21 

Let s be a solution of equation ( I7.7j) . Note that for any measure i/ G Mi(f2), since 
the logarithm function is concave, 



tr tr mo tr ko 



U(i)>0 v ; ) 

Consequently, d(u, //, 0) ^ s. Now the second part of our theorem follows from the first 
one if we take V = M^Q). □ 

Remark. In fact the "Bowen equation" is an equation of the form P(sip) = 0, 
where P(sip) is the topological pressure of a weight function sip in a dynamical system 
(more detailed explanations can be found in [13]). If we replace the topological pressure 
P(sip) by the spectral potential 

r 

\(s<p,fi) = ln^V^V(i), where (p(i) = In 0(0, 
i=i 

then the Bowen equation turns into the equation \(stp,/j) = 0, which is equivalent to 

(EZD. 



8 Block selections of colored brunching processes 

Let £i, ^2, £3, • • • be a sequence if independent identically distributed random variables 
taking values or 1 (independent Bernoulli trials). 

Lemma 8.1 If < p' < p < 1 and P{£j = 1} ^ p, then 

P{£l + ■■■ + & ^P'k} -> 1 ask^oo (8.1) 

uniformly with respect to the probability P{£j = 1} ^ p. 

Proof. In the case P{£j = 1} = p this follows from the law of large numbers. If 
P{£i = 1} increases then the probability in the left hand side of (18. ip increases as 
well. □ 
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Consider a colored branching process X 1; X 2 , ... with finite set of colors f2 = 
{1, . . . , r}. Each X n consists of genetic lines (xi, x 2 , • • • , x n ) of length n, in which every 
subsequent individual has been born by the previous. Fix a (large enough) natural N. 
We will split genetic lines of length divisible by N into blocks of length TV: 

(xi,x 2 , ■ ■ ■ ,x nN ) = (yi, . . . ,y n ), where y k = (x^^jv+i, . . . , x kN ). 

Each block y k generates an empirical measure 8 Vk (spectrum) on Q by the rule 

#{t\g(x t )=i, (k-l)N <t^kN} 
°vhW = ft > 

where g(x t ) denotes the color of x t . 

A block selection of order N from a colored brunching process X\, X 2 , ... is any 
sequence of random subsets Y n C X nN with the following property: if (yi, . . . ,y n +i) £ 
Y n+ i then (yi, . . . , y n ) G Y n . In this case the sequence of blocks (y\, . . . , y n +i) will be 
called a prolongation of the sequence (yi, . . . , y n ). 

As above (see (16 .2p ). denote by ji{i) the expectation of children of color % born by 
each individual, and by \x the corresponding measure on Q. 

Theorem 8.2 Let X%, X 2 , X3, ... be an unconditional colored brunching process with 
finite set of colors Q and probability of degeneration q* < 1 . If a measure v 6 M\ (fl) 
satisfies the condition p(u, fx) < 0, then for any its neighborhood 0{u) C Mi(fi) and any 
number e > with probability 1 — q* one can extract from the brunching process a block 
selection Y\,Y 2 , ... of an order N such that each sequence of blocks (yi, . . . , y n ) G Y n 
has at least l(N) prolongations in Y n+ i, where 

l(N) = e N(-pM-e)^ ( 8>2 ) 

and the spectra of all blocks belong to 0(u). 

Proof. Fix any numbers p and e satisfying the conditions 

0<p<p + e<l-q*, p(v, //)+£< 0. 

By the second part of Theorem 16.31 for all large enough iV we have 

P{#{x G X N I 6 X , N G 0(u)} > e^-P^)-^ 2 )} ^ p + £ . (8.3) 

Further we will consider finite sequences of random sets X%, . . . , X n ^ and extract 
from them block selections Y\, . . . , Y n of order such that the spectra of all their 
blocks belong to 0(v) and each sequence of blocks (yi, . . . ,y k ) G Yk has at least l(N) 
prolongations in Y^+i. Denote by A n the event of existence of a block selection with 
these properties. Define one more event A by the condition 

#{x G X N I 5 X>N G 0(i/)} > l{N)e m ' 2 . 

It follows from flOJ and Q that P(A) ^ p + e. Evidently, Ac4 Therefore, 
P(-Ai) ^ p + £■ Now we are going to prove by induction that P(A n ) ^ p whenever the 
order N of selection is large enough. Let us perform the step of induction. Assume 
that P(A n ) ^ p is valid for some n. Consider the conditional probability P(A n+ i|yl). 
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By the definition of events A n+ i and A it cannot be less than the probability of the 
following event: there are at least l(N) wins in a sequence of [l(N)e Ne / 2 ] independent 
Bernoulli trials with probability of win P(A n ) in each. Using Lemma [87T1 (with p' — p/2 
and k = [l(N)e Ne ^ 2 ]) one can make this probability greater than 1 — £ at the expense 
of increasing N. Then, 

P(A n+1 ) > P(A)P(A n+1 \A) > (p + e )(l- e ) >p . 

Thus the inequality P(A n ) > p is proved for all n. 

It means that with probability greater than p one can extract from the sequence 
X\, . . . , X u n a block selection Y\, . . . , Y n of order N such that the spectra of all blocks 
belong to the neighborhood 0(u) and each sequence of blocks (yi, . . . , yk) G Yk has at 
least l(N) prolongations in Yfc+i. 

To obtain a block selection of infinite length with the same properties, we will 
construct finite block selections Y\, . . . , Y n in the following manner. Initially, suppose 
that every Yk, where k ^ n, consists of all sequences of blocks (yi, . . . , yk) G XkN such 
that the spectrum of each block lies in 0(u). At the first step we exclude from F n _i 
all sequences of blocks having less than l(N) prolongations in Y n , and then exclude 
from Y n all prolongations of the sequences that have been excluded from Y n _\. At the 
second step we exclude from Y n _ 2 all sequences of blocks having after the first step 
less than l(N) prolongations in the modified Y n _i, and then exclude from Y n _i and 
Y n all prolongations of the sequences that have been excluded from Y n ^ 2 . Proceeding 
further in the same manner, after n steps we will obtain a block selection Yi, . . . , Y n 
such that each sequence of blocks from any Y k has at least l(N) prolongations in Y k+ \. 
Evidently, this selection will be the maximal among all selections of order N having the 
mentioned property. Therefore with probability at least p all the sets are nonempty. 

For every n let us construct, as is described above, the maximal block selection 
Y-[ n \ . . . , Yn n \ From the maximality of these selections it follows that 

y( n ) -j y(n+l) y(n+2) 

Define the sets Y n = f)k> n Yn k ^ ■ Then with probability at least p all of them are 
nonempty and compose an infinite block selection from Theorem 18.21 Since p may be 
chosen arbitrarily close to 1 — q*, such selections do exist with probability 1 — q*. □ 

Theorem 18.21 can be strengthened by taking several measures in place of a unique 
measure v G Mi(fi). 

Theorem 8.3 Let Xi, X 2 , X 3 , ... be an unconditional colored brunching process with 
finite set of colors Q and probability of degeneration q* < 1. If a finite collection of 
measures Vi G Mi(Q), where % = 1, k, satisfy the inequalities p(u i} /i) < 0, then 
for any neighborhoods 0{yi) C Mi(O) and any e > with probability 1 — q* one can 
extract from the brunching process a block selection Y\, Y 2 , ... of an order N such 
that for every % = 1, . . . , k each sequence of blocks (y\, . . . , y n ) G Y n has at least 

/.(AT) = e N(- P (u^)-e) 

prolongations (y\, . . . ,y n ,y) G Y n+ i with the property 5 y G 0(vj). 
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It can be proved in the same manner as the previous one, only now the event 
A n should be understood as existence of a finite block selection Yi, . . . , Y n satisfying 
the conclusion of Theorem 18.31 and the event A should be defined by the system of 
inequalities 

#{x £ X N | 5 X , N £ 0{vi)} > k(N)e N£/ \ i = 1, . . . , k. 
We leave details to the reader. 



9 Dimensions of random fractals (lower bounds) 

Now we proceed investigation of the space of infinite genetic lines 

^oo = {x = (x 1 ,x 2 ,x 3 , . . . ) | (xx, ...,x n )eX n Wn e N}, 

which is generated by an unconditional colored brunching process X\, X 2 , ... with 
finite set of colors Q = {1, . . . , r}. It is supposed that there is a measure 

v= (Mi), M r )) 

on Q, where fi(i) denotes the expectation of children of color i born by each individual, 
and Xqo is equipped with the cylinder ^-metrics (17. ip . 

Theorem 9.1 Let X\, X 2 , X3, ... be an unconditional colored brunching process with 
finite set of colors Q and probability of degeneration q* < 1 . If a measure v 6 M\ (Q) 
satisfies the condition d(v,fi,9) > 0, then with probability 1 — q* for any neighborhood 
0(u) we have the lower bound 

dim H {x e \ 3N Vn > N 5 x>n E 0{u)} ^ d(v,n,6). (9.1) 

Proof. Fix any number a < d(u, fi, 9) and so small e > that 

At a\ ^ p(u^) + 2e 

d\v,uL,9) = — — ; — — > — = — — > a. 9.2 

Then choose a convex neighborhood 0*(u) whose closure lies in 0(v) such that for any 
measure 5 G 0*(u) 

r r 

In 9(i) > v( j) lnd ( i ) - e - ( 9 - 3 ) 

i=i i=i 

By Theorem 18 . 2 1 with probability 1 — q* one can extract from the brunching process 
under consideration a block selection Yi, Y 2 , . . . of order iV such that any sequence of 
blocks (yi, ... , y n ) £ Y n has at least l(N) prolongations in Y n+ i, where 

l(N) = e N (~pM-^ i 

and for each block y%. = (x^-i)n+i, ■ ■ ■ ,x^n) the corresponding empirical measure 5 Vk 
(spectrum) belongs to 0*(u). Exclude from this selection a certain part of genetic lines 
in such a way that each of the remaining sequences of blocks (yi, . . . , y n ) £ Y n would 
have exactly [l(N)] prolongations in Y n+ \. 
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Define the random set 



Yoc = {y = (yi, 2/2, • • • ) | (yi, ■ ■ ■ ,y n ) g Y n , n = l, 2, . . . }. 

Any sequence y = (y±,y2, ■ ■ ■ ) G ^oo consists of blocks of length iV. Having written 
down in order the elements of all these blocks, we obtain from y an infinite genetic line 
x = (x\,X2, ■ ■ ■ ) G Xqo. Denote it as n(y). By the definition of Fqo the spectrum of 
each block y^ belongs to 0*(u). For every point x = ir(y), where y G Y^, the empirical 
measure S x>n ^ is an arithmetical mean of empirical measures corresponding to the first 
n blocks of y, and so belongs to 0*{v) as well. It follows that 

ttQ^) C {xeX OD \3N\/n>N 5 x , n G 0(u)}. (9.4) 

The family of all cylinders of the form Z n ^(x), where x G 7r(y^o), generates some 
cr-algebra on n^Y^). Define a probability measure P on this cr-algebra such that 

P{Z nN {x)) = [l{N)]- n . 

Then for all large enough N, all x G ^(Y^), and all natural n 

P(Z nN (x)) ^ e nN{ P M+2e)^ 

On the other hand, by (19. 3p 

nN f nN , r 

\Z nN (x)\ = Jj0(ar t ) = expl Y]]nB(x t ) > = expl ^ nN 5 x <nN (i) In 9 (i) [ > 

t=l ^ t=l ' ^ i=l 

^ ^ i=i 

It follows from the last two formulas and (19. 2p that 

\Z nN {x)\ a ^ exp|nJVa^JJi/(i)ln0(i) - e) J ^ e nAr(p( ^ )+2e) ^ P(Z nJV (x)). 

Now we are ready to compute the Hausdorff measure of dimension a of the set 
7r(Yoo). If, while computing the Hausdorff measure, we used coverings of ir(Y°a) no ^ 
with any cylinders, but with only cylinders of orders divisible by N, then the last 
formula would imply that such a measure will be at least P(tt(Y 00 )) = 1. Any cylinder 
can be put in a cylinder of order divisible by N such that the difference of their orders 
will be less than N and the ratio of their diameters greater than mm 9(i) N . Therefore, 



mes(7r(Y" 00 ), a) ~^mmd(i) 



Na 



and hence dim# 7r(Yoo) ^ a. 

The set defined in the right hand part of (19. 4p contains ^(Y^). Then its dimension 
is at least a too. Recall that we have proved this fact by means of a block selection 
that exists with probability 1 — q*. By the arbitrariness of a < d(v,/j,,6) this implies 
the desired bound (19. ip with the same probability. □ 
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Theorem 9.2 Let s be a root of the Bowen equation 



i=l 

If s ^ ; t/ien = almost surely. Otherwise, if s > 0, then X^ is nonempty with 
a positive probability, and with the same probability its dimension equals s. 

Proof. The expectation of total number of children of each individual in the brunch- 
ing process generating the set Xoq is equal to m = + . . . + /i(r). If s ^ 0, then 
m ^ 1. In this case by Theorem 15.21 our brunching process degenerates almost surely, 
and = 0. 

If s > 0, then m > 1. In this case by Theorem ref5..2 our brunching process 
is degenerate with a positive probability, and X^ is nonempty. Define a measure 
v G Mi(Q) by means of the equality 

v {i) = fi(i)e(^ s , i g a 

Then, evidently, d(v,/j,,6) = s. By the previous Theorem dim^Xoo ^ s with the same 
probability with which I ro ^ 0. On the other hand, by Theorem 17.21 the inverse 
inequality holds almost surely. □ 



A more general version of Theorem I9.2[ in which the similarity coefficients 0(1), 
. . . , 6{r) are random, is proved in [61 [3, [8j ITT] . 

For every probability measure v G Mi(fi) define a basin Biv) C X^ as the set of 
all infinite genetic lines x = (xi, X2, ^3, . . . ) such that the corresponding sequence of 
empirical measures 5 XtTl converges to v. What is the dimension of B(i/)7 By Theorem 
17. 21 it does not exceed the Billingsley-Kullback entropy d(u, /x, 6) with probability 1. On 
the other hand, the inverse inequality does not follow from the previous results (and, 
in particular, from Theorem 19.11) . To obtain it, we ought to enhance the machinery of 
block selections. 

Lemma 9.3 Let Q\, . . . , Q^r be vertices of a cube in W . Then there exists a choice 
law i: W — > {1, . . . , 2 r } such that if neighborhoods 0(Qi) are small enough, sequences 

S 1 + ... + 6 n 



<L G W and A r 



n 



satisfy the conditions 5 n+ i G 0(Qi(A„)) an d $i ^ 0(Qi) U ... U 0(Q2r), then the 
sequence A n converges to the center of the cube. 

Proof. First consider the case r = 1, when the cube turns to a segment. Let, for 
definiteness, Q± = —1 and Q2 = 1- Set 

i(A) = J 1 ' AS °- (9.5) 
[2, A<0. 

Take any neighborhoods 0(Qi) and 0(Q2) with radii at most 1. Then for any sequence 
5 n satisfying the conditions 5 n+ \ G 0(Qi(A„)) and \5\\ < 2 we have the estimate 
|A n | < 2/n. It may be easily proved by induction. Thus in the one-dimensional case 
the lemma is proved. To prove it in the multidimensional case one should choose a 
coordinate system with origin at the center of the cube and axes parallel to edges of 
the cube and apply the choice law (19.51) to each of the coordinates independently. □ 
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Theorem 9.4 Let X 1; X 2 , X s , ... be an unconditional colored brunching process with 
finite set of colors Q and probability of degeneration q* < 1 . If a measure v G M\ (Q) 
satisfies the condition d(v,p,9) > 0, then with probability 1 — q* 

dim// B(v) = d(v, p, 9). 

Proof. Fix any number a < d(u, p, 9) and so small e > that 

p(i/,//) + 3e 

d(v,fi,6) = — r > > a- 

The set Mx(O) is in fact a simplex of dimension r — 1, where r = |0|. Suppose first 
that v is an inner point of this simplex (in other words, v{%) > for all i G ft). Take a 
small convex neighborhood 0(v) C Mi(Q) such that for any measure 5 G 0(u) 

p(S,fi) < p(y,fi) +e, 

r r 

i=i i=i 

Let Qi, • ■ ■ , Q-p- 1 De vertices of some cube in 0(z/) with center at v. Define for 
them small neighborhoods 0(Qi) C 0{u) as in Lemma 19.31 By Theorem 18.31 with 
probability 1 — q* one can extract from the colored branching process X±, X2, ... a 
block selection Yi, Y2, ... of order iV such that for every i ^ 2 r_1 each sequence of 
blocks (yi, ... , y n ) G F n has at least 

1{N) = e m-pM-2e) 

prolongations (yi, . . . ,y n ,y) G Y n+ i possessing the property 5 y G 0(Qi). 

Exclude from this block selection a certain part of genetic lines so that each of the 
remaining sequences of blocks (yi, . . . , y n ) G Y n would have exactly [l(N)] prolongations 
(yi, . . . ,y n ,y) G Y n+1 , and all these prolongations would satisfy the choice law from 
Lemma [9. 3 [ namely, 

S y eO(Q i(An) ), where A n = Vt^l±jfe , 

Denote by ir(Y 00 ) the set of all infinite genetic lines (x\,X2, ■ ■ ■ ) G Xoo for which 
every initial segment of length nN, has been partitioned into blocks of length N, turns 
into an element of Y n . Then by Lemma [9.31 we have the inclusion 7r(y^o) C B(v). 

Reproducing reasoning from the proof of Theorem 19.11 one can ascertain that the 
dimension of TriY^) is greater than a. Since a can be taken arbitrarily close to d(v, p, 9), 
we obtain the lower bound dim// B(i>) ^ d(i>,p,9). The inverse inequality, as was 
mentioned above, follows from Theorem 17.21 Thus in the case of inner point v G M\ (Q) 
the theorem is proved. 

If the measure v belongs to the boundary of the simplex Mj(fl), then one should 
exclude from Q all elements % with v{%) = 0, and consider the set 

£1' = {i G I v{i) > 0}. 
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Exclude from the brunching process X 1; X 2 , ... all genetic lines containing elements 
of colors not in Q' and denote as XJ, X' 2 , ... the resulting brunching process (with 
the set of colors Q'). The corresponding set of infinite genetic lines X'^ is contained in 
Xoo. It follows from the definition of Billingsley-Kullback entropy d(u,fi,6) that it is 
the same for the sets of colors Q and Q'. Besides, the measure v lies in the interior of 
the simplex Mi (£]'). Therefore, dim# B{v) flX^ = d(u, fi, 9) with the same probability 

as K / z. 

The theorem would be completely proved if the probability of the event X'^ ^ 
was equal to 1 — q*. But it may be less than 1 — q*. This obstacle can be overcome as 
follows. 

Let m! = MO- This is nothing more than the expectation of each individual's 

number of children in the brunching process X[, X' 2 , ... If m! ^ 1 then (12. 4p implies the 
inequality piy^n) ^ 0, which contradicts the condition <i(z/, /i, #) > of our theorem. 
Therefore m! > 1 and, respectively, the probability of the event X'^ = is strictly less 
than 1. Let us denote it q' . 

If the brunching process X' v X 2 , . . . was generated not by a unique initial element, 
but k initial elements, then the probability of X'^ = would be equal to (q') k . Recall 
that the cardinality of X n grows exponentially with probability 1 — q*. If this is the 
case, one can first wait for the event \X n \ ^ k, and then consider separately \X n \ 
independent counterparts of the brunching process X[, X' 2 , ... generated by different 
elements of X n . This trick allows to obtain the bound dim# B(u) ^ d(u,n,6) with 
conditional probability at least 1 — (q') h under the condition \X n \ — > oo. Since k is 
arbitrary, the above mentioned conditional probability is in fact one, and the complete 
probability cannot be less than 1 — q* . □ 
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